Goto

Collaborating Authors

 competition format


PD$^3$: A Project Duplication Detection Framework via Adapted Multi-Agent Debate

Bao, Dezheng, Yang, Yueci, Chen, Xin, Jiang, Zhengxuan, Fei, Zeguo, Zhang, Daoze, Huang, Xuanwen, Chen, Junru, Yu, Chutian, Yuan, Xiang, Yang, Yang

arXiv.org Artificial Intelligence

Project duplication detection is critical for project quality assessment, as it improves resource utilization efficiency by preventing investing in newly proposed project that have already been studied. It requires the ability to understand high-level semantics and generate constructive and valuable feedback. Existing detection methods rely on basic word- or sentence-level comparison or solely apply large language models, lacking valuable insights for experts and in-depth comprehension of project content and review criteria. To tackle this issue, we propose PD$^3$, a Project Duplication Detection framework via adapted multi-agent Debate. Inspired by real-world expert debates, it employs a fair competition format to guide multi-agent debate to retrieve relevant projects. For feedback, it incorporates both qualitative and quantitative analysis to improve its practicality. Over 800 real-world power project data spanning more than 20 specialized fields are used to evaluate the framework, demonstrating that our method outperforms existing approaches by 7.43% and 8.00% in two downstream tasks. Furthermore, we establish an online platform, Review Dingdang, to assist power experts, saving 5.73 million USD in initial detection on more than 100 newly proposed projects.


Simulation leagues: Analysis of competition formats

Budden, David, Wang, Peter, Obst, Oliver, Prokopenko, Mikhail

arXiv.org Artificial Intelligence

The selection of an appropriate competition format is critical for both the success and credibility of any competition, both real and simulated. In this paper, the automated parallelism offered by the RoboCupSoccer 2D simulation league is leveraged to conduct a 28,000 game round-robin between the top 8 teams from RoboCup 2012 and 2013. A proposed new competition format is found to reduce variation from the resultant statistically significant team performance rankings by 75% and 67%, when compared to the actual competition results from RoboCup 2012 and 2013 respectively. These results are statistically validated by generating 10,000 random tournaments for each of the three considered formats and comparing the respective distributions of ranking discrepancy.

  Country:
  Genre: Research Report (1.00)
  Industry: Leisure & Entertainment > Sports > Soccer (1.00)